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(54) Title: A STANDALONE INTERACTIVE TOY 
(57) Abstract 

A standalone interactive toy (1 1) having an adaptive personality 
and utilizing artificial intelligence which responds to the user and 
which has learning capabilities. The toy is capable of initiating 
communication with the user, adapting to the user and developing 
an individual "personality" through interaction with the user. The 
standalone interactive toy includes a plurality of characteristics which 
include, inter alia, a plurality of interaction characterizations between 
the user and the toy, means for receiving and characterizing vocal input 
from the user, means for changing the adaptive personality thereby 
creating a current personality characteristic for the toy, memory 
storage means, means for generating a response which is responsive to 
the characterized vocal input and the current personality characteristic 
of the toy and the audio output means for outputting the response. 



11 




FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


hah? 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cdte dM voire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 99/32203 



PCT/IL98/00617 



A STANDALONE INTERACTIVE TOY 

FIELD OF THE INVENTION 

The present invention relates to talking toys generally and more 
particularly to an improved interactive toy with artificial intelligence capable of 
5 learning from its interaction with a child. 

BACKGROUND OF THE INVENTION 

It has long been recognized that for a toy doll or figure to appear more 
lifelike to a child, there should be interaction between the child and the figure. It 
is known to utilize simulated speech which produce sounds in response to 
10 movement of the figure or by selecting pre-recorded messages in response to 
activation by the user. 

US Patent No. 4,696,653 to McKeefery describes a toy doll which 
responds with spoken words or phrases to the touching of selected areas of the 
doll by the user or in response to the user's voice. The doll responds to being 
15 touched on specific parts of its body by one of two phrases related to the 
particular part of the doll's body being touched. When the user speaks to the 
doll, the doll replies with a randomly selected sentences. 

US Patent No. 4,923,428 to Curran describes an interactive talking toy 
which simulates interaction with the child by appearing to respond to input from 
20 the child. The speech and movement of body parts of the toy are controlled by 
multiple audio tracks. The particular audio track reproduced at any time is 
directed by a processor on the basis of a response from a human. 

The existing prior art is restricted to passively responding to physical or 
vocal activation by the user. Further, existing interactive toys are limited in their 
25 possible responses which reduces the interest of the child after a short period. 
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SUMMARY OF THE INVENTION 

An object of the present invention is to provide a standalone interactive 
toy such as a doll configured to talk to the user and respond to vocal input from 
the user. 

5 A further object of the present invention is to provide an interactive toy 

utilizing artificial intelligence which responds to the user and which has learning 
capabilities. The toy is capable of initiating communication with the user, adapting 
to the user and developing an individual "personality" and other characteristics 
through interaction with the user. 

10 There is thus provided, in accordance with a preferred embodiment of 

the invention, a standalone interactive toy having an adaptive personality which 
includes a plurality of characteristics, the plurality of characteristics representing at 
least one of a group including a plurality of personality characteristics of the toy, a 
plurality of personality characteristics of an interactive user and a plurality of 

15 interaction characterizations between the user and the toy; means for receiving 
and characterizing vocal input from the user; means for changing the adaptive 
personality thereby creating a current personality characteristic for the toy; 
memory storage means for storing the current personality characteristic of the toy 
and the plurality of characteristics; means for generating a response, the 

20 generated response being responsive to the characterized vocal input and the 
current personality characteristic of the toy and audio output means for outputting 
the response. 

Furthermore, in accordance with a preferred embodiment of the 
invention, the standalone interactive toy further includes means for amending the 

25 plurality of characteristic means according to at least one of a group of 
environment conditions including random changes, time based changes, direct 
and indirect feedback from the interacting user, periodic changes and 
programmed changes. 

In addition, in accordance with a preferred embodiment of the invention, 

30 the standalone interactive toy further includes decision making means wherein the 
generated response is responsive to the decision making means. The decision 
making means is adjustable in accordance with the vocal input from the user 
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and/or is adjustable in accordance with the usage of each of the plurality of 
characteristics. Also, the decision making means are determined by one of a 
group of conditions including randomization, weighted randomization, the usage 
of each of the plurality of characteristics and on the basis of which of the plurality 
5 of characteristics has been recently selected. 

Furthermore, in accordance with a preferred embodiment of the 
invention, the toy further includes a plurality of question types and means for 
determining which of the plurality of question types is to be asked by the toy. 

Furthermore, in accordance with a preferred embodiment of the 
10 invention, the toy further includes means for assessing the interactive user's 
response to each of the plurality of question types, for determining whether a 
subsequent question should be asked and means for formulating the subsequent 
question to ask. The interactive user's response includes one of a group of 
responses including non-response or non-reaction within a pre-determined period 
15 Furthermore, in accordance with a preferred embodiment of the 

invention, the toy further includes means for determining the subsequent step to 
take after a specific activity has ended and to decide whether and how to interrupt 
an ongoing activity. 

Furthermore, in accordance with a preferred embodiment of the 
20 invention, the initial vocal response is generated on the toy being powered up. 

Furthermore, in accordance with a preferred embodiment of the 
invention, the processor is any one of a group of processors including a central 
processing unit (CPU), a digital signal processor (DSP), microprocessor or 
micro-controller. 

25 Additionally, in accordance with a preferred embodiment of the 

invention, the memory storage means includes at least one of a group of memory 
storage units including a ROM (read only memory) unit, a ROM paging unit, and a 
RAM (random access memory) unit. The memory storage means includes 
interactive computer games. 

30 Furthermore, in accordance with a preferred embodiment of the 

invention, the standalone interactive toy further includes a non-volatile memory 
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unit. The non-volatile memory unit includes at least one of a group including flash 

memory and EEPROM memory. 

Furthermore, in accordance with a preferred embodiment of the 

invention, the standalone interactive toy further includes a replaceable cartridge 
5 unit and/or a PC computer connectable to the processor. The cartridge unit 

and/or a PC computer includes interactive computer games. Also the cartridge 

unit and/or a PC computer includes at least one of a group of memory storage 

units including flash memory, EEPROM memory and ROM memory. The 

interactive computer games are time and date dependent. 
10 A standalone interactive toy according to claim 19 and further includes a 

communication link coupled to the processor and an external computer coupled to 

the communication link. 

Furthermore, in accordance with a preferred embodiment of the 

invention, the standalone interactive toy further includes at least one of a group of 
15 actuators responsive to activation by the user including a plurality of sensors, 

indicators and a communication link connected to the processor. The sensors 

comprise touch sensitive contacts having specific operating functions. 

Additionally, in accordance with a preferred embodiment of the 

invention, the ROM (read only memory) unit includes at least one of a group 
20 including: a library of recognition words; software for activating the recognition 

words library, a synthesizer library; software for activating the synthesizer library, 

application software for controlling and activating the doll, and a text database 

storing data for the use with the vocal input converting means. 

Furthermore, in accordance with a preferred embodiment of the 
25 invention, the vocal input converting means includes a text-to-speech (TTS) 

converter. 

Additionally, in accordance with a preferred embodiment of the 
invention, the processor includes at least one of a group of including: modules for 
speech recognition, artificial intelligence , music synthesis , speech synthesis and 
30 digital sound processing. 
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Furthermore, in accordance with a preferred embodiment of the 
invention, the standalone interactive toy further includes a clock and a scheduler 
for scheduling games and activities dependent on time and date. 

In addition, there is thus provided, in accordance with a preferred 
5 embodiment of the invention, a method of interactive communication between an 
adaptive standalone interactive toy and a user, the method comprising the steps 
of: 

creating a plurality of characteristics, the plurality of characteristics 
representing at least one of a group including a plurality of personality 
10 characteristics of the toy, a plurality of personality characteristics of an interactive 
user and a plurality of interaction characterizations between the user and the toy; 
creating a current personality characteristic for the toy; 
receiving and characterizing the vocal input from the user; 
storing the current personality characteristic of the toy and the plurality of 
15 characteristics; 

generating a response, the generated response being responsive to the 
characterized vocal input and the current personality characteristic of the toy; and 
outputting the response. 

Furthermore, in accordance with a preferred embodiment of the 
20 invention, the method of interactive communication further includes the step of 
amending the plurality of characteristic means according to at least one of a group 
of environment conditions including random changes, time based changes, direct 
and indirect feedback from said interacting user, periodic changes and 
programmed changes. 
25 Furthermore, in accordance with a preferred embodiment of the 

invention, the method of interactive communication further includes the step of 
decision making, the step of generating a response being responsive to the step 
of decision making. 

Additionally, in accordance with a preferred embodiment of the 
30 invention, the method of interactive communication further includes the step of 
determining whether a subsequent question should be asked; and 
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if so, determining which of a plurality of question types is to be asked by 

said toy. 

Furthermore, in accordance with a preferred embodiment of the 
invention, the method further includes the step of assessing said interactive user's 
5 response to each of said plurality of question types and the step of formulating the 
subsequent question to ask. 

Furthermore, in accordance with a preferred embodiment of the 
invention, the method further includes the step of determining what step to take 
after a specific activity has ended and/or determining whether and how to interrupt 
10 an ongoing activity. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will be understood and appreciated more fully 
from the following detailed description taken in conjunction with the appended 
drawings in which: 

5 Fig. 1A illustrates an interactive speaking doll, in accordance with a 

preferred embodiment of the invention; 

Fig. 1B is a high level block diagram illustration of the main elements of 
the interactive speaking doll of Fig. 1A; 

Fig. 2 is a block diagram illustration of the electronic components of the 
10 doll of Fig. 1 A; 

Fig. 3 is a schematic representation of the software application 
components which are processed by the electronic components of the doll of Fig. 
1; 

Fig. 4 is a schematic representation of the contents of the ROM and 
15 RAM units of the electronic components of the doll of Fig. 1 ; 

Fig. 5 is a high level flow chart illustration of the question asking 
mechanism; and 

Figs. 6A, 6B and 6C are flow chart diagram illustrations of the functional 
operation of the doll of Fig. 1 . 

20 
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DETAILED DESCRIPTION OF THE PRESENT INVENTION 

Reference is now made to Figs. 1A, 1B and 2 which illustrate the 
interactive speaking doll, generally referenced 11, in accordance with a preferred 
embodiment of the invention. 

5 The mechanism, generally designated 10, for operating the interactive 

speaking doll in accordance with a preferred embodiment of the invention is 
embodied within the body 12 of doll 11, illustrated in Fig. 1A. Fig. 1B is a high 
level block diagram illustration of the main elements of the interactive speaking 
doll 11. Fig. 2 is a block diagram illustration of the electronic components of the 

10 mechanism 10. The mechanism 10 is controlled by a processor 20. For the 
purposes of example only, processor 20 is hereinbelow referred to as a central 
processing unit (CPU). The processor 20 may include any processing unit 
including a central processing unit (CPU), a digital signal processor (DSP), 
microprocessor or micro-controller. 

15 The interactive speaking doll 1 1 is configured to talk to the user and to 

respond to the vocal commands of the user via text-to-speech (TTS) and voice 
recognition software, respectively, generated by means of the processor 20. The 
interactive speaking doll 11 is a standalone doll and does not require to be 
connected to another unit, such as a computer or video game to operate. 

20 A feature of the interactive speaking doll 1 1 of the present invention is its 

use of artificial intelligence (Al) and its machine learning capabilities. The doll 1 1 
is capable of initiating communication with the user and adapting itself and 
developing an individual "personality" through interaction with the user. The 
various features of the doll will be described in detail hereinbelow. 

25 The mechanism 10 comprises a microphone 14 (for audio input) and a 

speaker 16 (for audio output) linked via a codec 18 to processor 20. 

Processor 20 comprises a vocal input converter 90, a response 
generator 92 and a text to speech (TTS) converter 94. 

The electronic components of the mechanism 10 further comprise 

30 memory storage means, generally designated 19 (Fig. 1A) comprising a ROM 
(read only memory) unit 22, a ROM paging unit 24, a RAM (random access 
memory) unit 26 and a non-volatile memory unit 28 (best seen in Fig.2). In 
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addition, the doll 11 comprises a plurality of sensors 30, indicators 32 and a 
communication link 34, all of which are connected to the central processing unit 
(CPU) 20. 

The mechanism 10 is controlled by a power control unit 36 and powered 
5 by a battery power supply 37. The mechanism 10 further comprises clock units 
42 and 43 for controlling the timing of the processor 20 and codec 18, 
respectively. Optionally, a cartridge unit 38 and/or a computer (PC) 40 may also 
be connected to processor 20. Optionally, communication link 34 can be coupled 
to an external computer (PC) 35. 
10 A microphone amplifier 44 is connected between microphone 14 and 

codec 18 for amplifying the analog signals prior to being decoded. A second 
amplifier 45 is used to amplify the doll's voice signals being broadcast via the 
speaker 16. 

Codec 18 is a standard known in the art coder/decoder device for 
15 converting analog signals to digital for transmission and vice versa for converting 
digital signals to analog. 

The sensors 30 comprise touch sensitive contacts, generally designated 
46, which are disposed beneath various parts of the doll 1 1 , such as the eyes, 
ears and mouth, for example. The sensors 30 may also optionally comprise 
20 buttons, generally designated 48 having specific operating functions, which may 
be activated by the user. 

The indicators 32 may comprise light emitting diodes (LEDs) to attract 
the user's attention to the doll 1 1 . 

Reference is now also made to Figs. 3 and 4. Fig. 3 is a schematic 
25 representation of the various software application components which are 
processed by processor 20. Fig. 4 is a schematic representation of the contents 
of the ROM unit 22 and RAM unit 26. 

Processor 20 is configured to process software applications and 
comprises modules for speech recognition 50, speech synthesis 52, music 
30 synthesis 54, the generation of sound effects 56, artificial intelligence 58 and 
digital sound processing 60. 
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The vocal input converter 90 (Fig. 1B) of the processor 20 analyzes and 
processes the speech of the user using the recognition module 50 and speech 
recognition program 64 in conjunction with the word library 62. 

Response generator 92 utilizes software driven algorithms to decide on 
the appropriate response. The "decision making" algorithms, which are described 
in greater detail below, utilize techniques from control theory and machine 
learning to render an appropriate response. 

The text to speech (TTS) converter 94 utilizes a combination of speech 
synthesis 52, music synthesis 54, sound effect blocks 56 together with the 
synthesizer library 66 and synthesizer software 68 to output the response via 
speaker 16. 

The memory space available in ROM unit 22 is suitably partitioned and 
is sufficiently large to be store the required libraries and programs. ROM unit 22 
comprises at least the following elements: 

a) a library of recognition words 62 and software 64 for activating the 
recognition words library 62, 

b) a synthesizer library 66 and software 68 for activating the 
synthesizer library, 

c) application software 70 for controlling and activating the doll, and 

d) a text database 72 for the use with the Text-to-Speech (TTS) 
software. 

The memory space available in RAM unit 26 is suitably , partitioned into a 
main application work area 74, a synthesis work area 76 and a recognition work 
area 78 for use by the response generator 92, the text to speech (TTS) converter 
94 and the vocal input converter 90. 

The recognition words library 62 comprises a database containing 
strings of words or phrases. The extent of the database and the capability of the 
speech recognition software 50 are dependent upon the processing speed and 
memory capacity available. 

Speech recognition software 50, such as the automatic speech 
recognition (ASR) products from Lemout and Hauspie of Massachusetts, USA, 
are commercially available. Briefly, automatic speech recognition (ASR) is a 
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process by which machines understand spoken output and is generally capable of 
recognizing natural and fluently spoken words continuous digits, isolated words, 
key words and the alphabet. 

Automatic speech recognition is speaker independent. That is, it is 
5 capable of Being used by different speakers without speaker involvement. ASR 
can adapt to changes in background knowledge level and the users manner of 
speech pronunciation tones of speech and pitch. In addition to recognizing 
individual words the ASR also has the capability of processing and recognizing a 
string of naturally and continuously spoken words. 

10 A further feature of ASR is its capability to recognize isolated words and 

to identify key words from a group of surrounding words from other sounds and 
continuous speech. For example the ASR could recognize the word "song" from 
a phrase such as "I want a happy song". 

The text to speech (TTS) software converts computer readable text 

15 stored in the TTS database 72 into computer generated speech. Preferably the 
text to speech is phoneme based, capable of reading any word whose 
pronunciation is consistent with the general rules of pronunciation for the 
particular language. The available vocabulary is only limited by the capabilities of 
the processor and size of the memory storage. TTS attempts to mimic natural 

20 sounding speech and an alternative embodiment the speech output (speaker 
volume, speech rate and speech pitch) is adjustable by the processor 20. 

Text to speech (TTS) modules are commercially available, for example 
the TTS3000 module from Lernout and Hauspie, Massachusetts, USA. 

The use of a processor 20 together with text-to-speech (TTS) and voice 

25 recognition software as integral components of the doll 11, allows the doll to 
interact with the user. Specifically, the response generator 92 of the interactive 
speaking doll 1 1 of the invention utilizes algorithms and functions which include 
the following elements: 

1 . Personality Characteristic development; 

30 2. Decision Making algorithms; 

3. Question and Answer mechanisms; and 

4. What Next (WN) mechanism 
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These specific features will now be described. 

Personality Characteristic Development 

Using the principles of Vector Quantization, that is using a short vector 
of numbers to represent a complex system, each of the personality 
'characteristics' of the doll and of the user can be represented by a vector of 
numbers. For example, a personality characteristic, such as the sense of humor 
or the intelligence of the doll can be represented by a number from 0 to 100, 
where 0 means "not at all" and 100 means "the maximum". 

Similarly, personality characteristics which describe the user of the doll, 
such as an estimate of the user's intelligence or how much the user likes jokes or 
songs, can also represented by a number from 0 to 100. 

The development of the doll's personality is adaptive, that is, its 
personality changes and adapts to the responses made by the user and 
'environment' changes. Every response of the doll can be related to one or more 
of characteristics of the doll and the user. For example, if the doll is deciding 
whether to yawn or not, it will take into account the 'tiredness characteristic' of the 
doll and how much the user likes non-regular responses (that is, interruptions in 
the regular flow of events). 

Non-limiting examples of "characteristics" include: 

A. Patience - When the doll is impatient it will only accept one wrong 
answer in response to a question, and when the doll is patient it will 
allow the user several tries to find the correct answer. 

B. Tiredness - When the doll is tired it will be less active, that is, it will 
initiate less and will yawn every now and then. 

C. Doll's Amusement - When the doll is amused it will laugh more 
often, try to initiate funnier activities and will act in a more pleasant 
manner and be "nicer" to the user. 

D. User's intelligence - If the user's intelligence is considered to be 
high, he will be asked tougher questions and the doll will assume 
that the user knows what he wants well. For example, if the doll 
has to decide whether to accept the user's proposal for a game, it 
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will give a higher consideration to a suggestion made by an 
"intelligent" user than an "unintelligent" one. 
The various characteristics are reflected in the behavior of the doll, the 
way the doll treats the user and the agenda of the doll. Every characteristic can 
5 be reflected in any or all of the above. 

All characteristics are stored in the non-volatile memory unit 28 such as 
an EEPROM, so the doll's actions and characteristic behavior continue 
uninterrupted, even after being turned off. 

To emulate a 'personality 1 , the characteristics are constantly subject to 
10 change according to 'environment* measurements, which may be listed as follows: 

1. Random changes - which represent "moods". For example, the doll 
can become sick or angry for no reason. If the doll becomes sick, several 
characteristics are changed. The decision on becoming sick is random and thus 
after the doll decides to be sick, the "tiredness" characteristic will increased, and 

15 the "amusement" characteristic will decrease, for example. 

2. Time based (or periodic) changes - the doll can become more tired 
later in the day, for instance; 

3. Programmed changes - the doll becomes impatient as directed by 
software programmed commands. The doll can be programmed to have certain 

20 "feelings". For example, in a game which requires foolish behavior, the doll's 
'amusement 1 characteristic is increased so that the overall behavior of the doll will 
become more. amusing and funny. 

4. Direct Feedback - from the user. For example, if the user answers 
difficult questions correctly, the user's intelligence will increase; and 

25 5. Indirect Feedback - from the user can indicate if decisions that were 

made in acceptance with certain characteristics in the past are liked by the user. 
For example, if the doll plays a song and the user doesn't respond after a time, it 
is deduced that the user doesn't like songs and therefore the user's "like songs" 
characteristic is reduced. 

30 Both the Direct and Indirect Feedback models also affect characteristics 

which relate to interaction between user and the doll. In the case of the Indirect 
Feedback model, the user's response is generally delayed. 
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The various characteristics may vary in their rate of change, that is how 
much a measurement affects the value of the characteristic. The characteristic 
has various properties such as 'confidence level' which represent how much the 
value can be trusted and other information. 

5 Decision Making 

A further feature of the interactive toy, is the response generator 92 
which activates a software driven component that performs "decision making". 
The "decision making" algorithms are weighted according to feedback and the 
actual usage of the dolls characteristics to customize decisions to the specific doll 
10 and the user. Techniques from control theory and machine learning are utilized to 
adapt the "decision making" algorithms. The weights of decisions and the 
decisions taken are stored in the non-volatile memory 28. 

The "decision making" weighting mechanism allows for different options 
to be chosen. For example, the decision to which of three stories, "A", "B" or "C", 
15 should be read, can be linked to the "humor' characteristic of the doll. The 
probability of choosing a particular story such as "A", which is rated as "funny", will 
be increased (or reduced) depending on the mood of the doll. Thus, if story A is 
classed as 'funny 1 , when the doll is in a humorous mood, the weight of story "A" 
will be increased by the "humor" characteristic 
20 There are several possible "decision making" strategies, non-limiting 

examples of which include: 

1 .Choosing at random (All options have the same probability) 
2.Choosing at random where the probability of choosing an option is 
proportional to ifs weight. 
25 3.Choosing the highest/lowest weight option. 

4.Choosing options that were not chosen lately (buffered n-length 

FIFO). 

After a decision has been made and feedback measured (either directly 
or indirectly), the basic weight of the options can be updated. The "weighting" can 
30 be stored in the non-volatile memory 28. 
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Question and Answer Mechanisms 

The type of questions which the doll asks the user are defined by known 
Artificial Intelligence (Al) mechanisms which enables the automatic handling of 
questions. Additionally, characteristics which represent the user's ability to 
understand questions are taken into account when determining the type of 
question to ask. 

Non-limiting examples of the types of questions which the doll can ask 
the user are as follows: 

1 . "Open" question where any answer is acceptable. 

2. "Yes/No" question - Only a yes or no is an acceptable answer. 

3. "Open Menu" questions - There are several known answers that are 
either correct or relevant. An "Open Menu" question is asked without naming any 
of the possible answers; for example: "what is the color of my hat n . 

4. "Enumerate Menu" Questions - There are several known answers 
as in "Open Menu" question. The question asked also gives some of the possible 
answers ; for example: "what is the color of my hat, red blue or green ? n . 

5. "Yes/No Menu" questions - There are several known answers as in 
"Open Menu" questions and in this case, the question asked is followed by asking 
a 'Yes /No' question; for example: "what is the color of my hat, is it red T 

For "Yes/No" questions, the questions are asked directly. The- Artificial 
Intelligence mechanism learns whether the user understands how to answer 
Yes/No type question. If the user does not respond as expected, then a short 
explanation (there are several explanation according to the user's behavior and 
past) is added to the question. 

For "Menu" type Questions (Open, Enumerate or Yes No), the algorithm 
makes a decision based on the question asked, the list of relevant and correct 
answers (a "menu") and the maximal number of times that the question can be 
asked. Reference is now made to Fig. 5, which is a high level flow chart 
illustration of the question asking mechanism. 

The initial type of question to be asked (step 102) is determined, based 
upon the user's intelligence, local and global understanding and the last question 
asked. The type of question is selected from the group of either "Open", 
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"Enumerate" or "Yes/No" questions (step 104) The operational flow then 
continues depending on the answer given (query box 105). 

If the correct answer is given, the relevant databases including 
characteristics database are updated (step 106). If the wrong answer is given, a 
decision made whether to ask the question again (query box 108). The decision 
is made according to the maximal number of questions allowed, upon the user's 
intelligence and understanding and the doll's characteristics (such as patience). 

If a follow-up question is to asked, the kind of question to be asked 
(Open, Enumerate or Yes/No) (step 104)) is determined according to the user's 
intelligence and understanding (step 110) and the above loop steps 104 -110 
continues. 

If a follow-up question is not being asked, the relevant databases are 
updated (step 106). 

During updating (step 106), other parameters, including the number of 
consecutive questions not having a response or the number of Yes/No questions 
answered incorrectly are also updated. 

What Next fWN) Mechanism 

A further feature of the interactive toy, is a software driven mechanism 
called the "What Next" mechanism which allows the doll 1 1 to decide what to do 
after a specific activity ended and decide whether and how to interrupt an ongoing 
activity. Briefly, the "What Next" mechanism uses a technique called "RuleBase", 
which is based on prior gained knowledge of the user and which checks any of a 
plurality of possible conditions have occurred and responds accordingly. 

For example, in deciding what action to take whenever a specific activity 
has ended, the various operational steps of the "What Next" mechanism can be 
described as follows: 

A. Non-User Response : If the user did not react or respond within a 
pre-determined period, for example, if the user didn't either switch 
the doll off or call the doll or verbally address the doll to suggest a 
new activity, the mood of the doll 11 can be changed? 
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B. Mood Changes : A check is made to determine periodic or random 
changes in mood. Preferably, mood changes are not changed 
constantly, but rather only a pre-determined time period. 

C. Action Determination : An action decision is made. For example, 
the action decision may include whether to make a funny remark, 
asking the user to decide what to do or suggesting an activity to the 
user. 

i) If the user is asked to decide for himself, the question asked by 
the doll is weighted towards the user's preferred activities. The 
doll decides whether accept the user's suggestion, according to 
the mood of the doll and/or the user's recent behavior. 

ii) If the doll suggests a new activity to the user, activities that were 
not used recently are suggested first. Alternatively, the doll may 
insist on playing a certain activity if it is in a stubborn mood, for 
example. 

Reference is now made to the flow chart diagram of Figs. 6A - 6C, which 
are flow chart illustrations of an embodiment of the functional operation of the doll 
11. 

Once the doll 1 1 is switched on (step 200), initially by an on/off switch 
connected to the power control 36, the doll 1 1 is activated for play (step 202). 

There are two principal modes of operation, vocal (step 204) and 
non-vocal (step 206). Vocal activation may be either initiated by the doll itself 
(step208) or by the user (child) (step 210). Non-vocal operation is initiated by the 
user activating one of the sensors 30 or buttons 48. 

After the doll 11 initiates a request (step 208), a sub-routine 211 (Fig. 
6B) is performed. The doll 1 1 will wait for the user to respond (query box 212). If 
there is a hiatus in the play between the doll and the child, after a pre-selected 
period of time, the doll will make the selection (step 214). Depending on the 
routine logic, the routine will wait for the user's response (query box 213) or return 
to central control ((step 215). 

If the user responds to the doll's question, the response is analyzed and 
the appropriate action taken (step 216). Having completed the particular 
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sub-routine, the routine returns to the central control (202) to await further 
response (vocal or non-vocal). 

Depending upon how the child answers will depend upon the 
development of the game or operation. The type and length of question is 

5 determined by the programming and can, for example, be a single word such as 
"hello", or a longer phrase such as "How are you?", "Who are you?", and "What 
game do you want to play today?' 'Let me give you a choice". 

The doll can be configured so that the by default the doll initially waits for 
a pre-selected time period for the user to start a conversation. For example, if the 

10 child wishes to play with the doll it could activate the doll vocally (step 210) by 
calling it by name. If the child uses a longer phrase or a couple of words strung 
together, the doll will react according to how the child itself makes the request. In 
other words, the doll interacts with the child. If, for example, the child says "play 
with me", the doll may well inquire "what do you want to play? - A game?" - "or do 

15 you want me to sing a song?". 

The response the doll makes can also be time and date dependent. For 
instance, the doll can greet the user on her birthday by singing "happy birthday" or 
suggest playing a specific game on a specific date such as Christmas. 

If the child (user) talks to the doll 11, a sub-routine 217 (Fig. 6C) is 

20 performed. The child's vocal request is analyzed and processed by the speech 
recognition module 50 (step 218). If the word or phrase is 'understood', that is 
recognized by the speech recognition program 64 in conjunction with the word 
library 62, the doll 11 will decide on a response (step 220). The doll's response 
will depend on the application program, as described in detail hereinabove (Fig. 

25 5). The response may be in the form of a question to the user or the doll may 
begin 'playing the game' or 'singing the song' requested, for example. The main 
processing blocks used for the doll to speak include the speech synthesis 52, 
music synthesis 54, generating sound effects 56 blocks and the synthesizer 
library 66 and synthesizer software 68. 

30 If the speech recognition module 50 does not match the user's speech 

with its word library 62, the speech recognition program 64 can "recognize" the 
new words spoken and store them in the non-volatile memory unit 28 (step 222). 
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The speech recognition module 50 is basically a combination of 
recognition engines, language and environment dependent databases and 
application programming interfaces. The recognition engine has four processing 
blocks; data acquisition and pre-processing, feature extraction, acoustic match 

5 and dynamic programming. 

Data acquisition converts the spoken analog signal into digital form. 
Pre-processing such as automatic game control, echo cancellation and voice 
activity detection, for example, may be applied to the signal. After frequency 
analysis of the signal, the relevant acoustic phonetic information is captured in a 

10 feature vector every few msecs. Algorithms are adapted to recording channel and 
to suppression of background noise. 

The acoustic match block uses a probabilistic matching score which is 
computed between each feature vector to describe the smallest acoustic phonetic 
units which are know to the recognizer, such as context independent and context 

15 dependent phonemes and word models. The dynamic programming block finds 
the best match and alignment for the speech input on a word and sentence level 
by accumulating the acoustic scores over time within the restrictions imposed by 
lexical and syntactic constraints. 

Briefly, the text to speech (TTS) technology converts computer readable 

20 text stored in the TTS database 72 into synthetic speech in stages, first converting 
the text into a phonetic transcription, calculating the speech parameters and finally 
using these parameters to generate synthetic speech signals. In addition, the text 
to speech (TTS) technology is also capable of adding vocal and tonal 
enhancements, referred to as 'improved TTS'. 

25 The doll may be powered up initially by the on/off switch connected to 

the power control 36 (step 200) and activated non-vocally by pressing one of the 
plurality of operationally defined buttons 48 (steps 230, 232). A plurality of 
buttons 48, each of which has a different function, can be placed on the doll 11 in 
any suitable position, such as a belt 80 around the doll's waist (Fig. 1). Buttons 48 

30 can be configured with operational functions, such as "stop", "continue", "on", "off. 
In addition, buttons can be configured for specific reaction functions, for example 
"sing a song", or "play a game". 
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Preferably, the doll 11 is configured to switch itself off after a 
pre-determined period inactivity, that is not receiving any reaction from the 
child/user. Thus, after the doll has completed a specific routine (as described 
hereinabove with reference to Figs. 5B and 5C), the non-vocal routine, for 
example, can be used to terminate operations. Referring to Fig. 5A, the 
non-vocal routine awaits a user response (steps 230, 232). If a button is not 
pressed within a pre-determined time period (loop steps 234, 230), the program 
terminates (236). 

It will be appreciated by persons knowledgeable in the art that the above 
description of an interactive toy is not restricted to a 'talking' doll but is applicable 
to any toy or game. Furthermore, the functions and routines described are given 
by way of example only and are restricted thereto. By means of PC 40 and 
cartridge unit 38, it is possible to download additional units containing speech and 
music modules or other games, for example. 

It will be further appreciated that the present invention is not limited by 
what has been described hereinabove and that numerous modifications, all of 
which fall within the scope of the present invention, exist. Rather the scope of the 
invention is defined by the claims which follow: 
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CLAIMS 

1. A standalone interactive toy having an adaptive personality comprising: 

a plurality of characteristics, said plurality of characteristics 
representing at least one of a group including a plurality of personality 
characteristics of the toy, a plurality of personality characteristics of 
an interactive user and a plurality of interaction characterizations 
between said user and the toy; 

means for receiving and characterizing vocal input from said user; 

means for changing said adaptive personality thereby creating a 
current personality characteristic for said toy; 

memory storage means for storing said current personality 

characteristic of said toy and said plurality of characteristics; 

» 

means for generating a response, said generated response being 
responsive to said characterized vocal input and said current 
personality characteristic of said toy; and 

audio output means for outputting said response. 

2 . A standalone interactive toy according to claim 1 and further comprising 
means for amending said plurality of characteristic means according to at least 
one of a group of environment conditions including random changes, time 
based changes, direct and indirect feedback from said interacting user, periodic 
changes and programmed changes. 

3 . A standalone interactive toy according to claim 1 further comprising 
decision making means wherein said generated response is responsive to said 
decision making means. 

4 . A standalone interactive toy according to claim 3 wherein said decision 
making means is adjustable in accordance with the vocal input from said user. 

5 . A standalone interactive toy according to claim 3 wherein said decision 
making means is adjustable in accordance with said usage of each of said 
plurality of characteristics. 
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6 . A standalone interactive toy according to claim 3 wherein said decision 
making means are determined by one of a group of conditions including 
randomization, weighted randomization, said usage of each of said plurality of 
characteristics and on the basis of which said plurality of characteristics has 
been recently selected. 

7 . A standalone interactive toy according to claim 1 and further comprising a 
plurality of question types and means for determining which of said plurality of 
question types is to be asked by said toy. 

8 . A standalone interactive toy according to claim 7 and further comprising 
means for assessing said interactive user's response to each of said plurality of 
question types, for determining whether a subsequent question should be asked 
and means for formulating the subsequent question to ask. 

9 . A standalone interactive toy according to claim 8 wherein said interactive 
user's response includes one of a group of responses including non-response or 
non-reaction within a pre-determined period 

10 . A standalone interactive toy according to claim 1 and further comprising 
means for determining the subsequent step to take after a specific activity has 
ended and to decide whether and how to interrupt an ongoing activity. 

1 1 . A method of interactive communication between an adaptive standalone 
interactive toy and a user, said method comprising the steps of: 

creating a plurality of characteristics, said plurality of 
characteristics representing at least one of a group including a 
plurality of personality characteristics of said toy, a plurality of 
personality characteristics of an interactive user and a plurality of 
interaction characterizations between said user and said toy; 

creating a current personality characteristic for said toy; 

receiving and characterizing the vocal input from said user; 

storing said current personality characteristic of said toy and said 
plurality of characteristics; 
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generating a response, said generated response being responsive 
to said characterized vocal input and said current personality 
characteristic of said toy; and 

outputting said response. 

5 12 . A method of interactive communication according to claim 1 1 and further 
comprising the step of amending said plurality of characteristic means according 
to at least one of a group of environment conditions including random changes, 
time based changes, direct and indirect feedback from said interacting user, 
periodic changes and programmed changes. 

10 13 . A method of interactive communication according to claim 1 1 and further 
comprising the step of decision making and wherein the step of generating a 
response is responsive to the step of decision making. 

14 . A method of interactive communication according to claim 13 wherein the 
decision making step is adjustable in accordance with the vocal input from said 

15 user. 

15 . A method of interactive communication according to claim 13 wherein said 
decision making step is adjustable in accordance with said usage of each of 
said plurality of characteristics. 

16 . A method of interactive communication according to claim 13 wherein 
20 said decision making step is determined by one of a group of conditions 

including randomization, weighted randomization, the usage of each of said 
plurality of characteristics and on the basis of which said plurality of 
characteristics has been recently selected. 

17 . A method of interactive communication according to claim 1 1 and further 
25 comprising the steps of: 

determining whether a subsequent question should be asked; and 
if so, determining which of a plurality of question types is to be 
asked by said toy. 

18 . A method of interactive communication according to claim 17 and further 
30 comprising the step of assessing said interactive user's response to each of 
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said plurality of question types and the step of formulating the subsequent 
question to ask. 

19 . A method of interactive communication according to claim 18 wherein 
said interactive user's response includes one of a group of responses including 

5 non- response or non-reaction within a pre-determined period. 

20 . A method according to claim 1 1 and further comprising the step of 
determining what step to take after a specific activity has ended. 

21 . A method according to claim 1 1 and further comprising the step of 
determining whether and how to interrupt an ongoing activity. 
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